Statistical Modeling: The Two Cultures

نویسنده

  • Leo Breiman
چکیده

There are two cultures in the use of statistical modeling to reach conclusions from data. One assumes that the data are generated by a given stochastic data model. The other uses algorithmic models and treats the data mechanism as unknown. The statistical community has been committed to the almost exclusive use of data models. This commitment has led to irrelevant theory, questionable conclusions, and has kept statisticians from working on a large range of interesting current problems. Algorithmic modeling, both in theory and practice, has developed rapidly in fields outside statistics. It can be used both on large complex data sets and as a more accurate and informative alternative to data modeling on smaller data sets. If our goal as a field is to use data to solve problems, then we need to move away from exclusive dependence on data models and adopt a more diverse set of tools.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Society with Statistical Mechanics: an Application to Cultural Contact and Immigration

We introduce a general modeling framework to predict the outcomes, at the population level, of individual psychology and behavior. The framework prescribes that researchers build a cost function that embodies knowledge of what trait values (opinions, behaviors, etc.) are favored by individual interactions under given social conditions. Predictions at the population level are then drawn using me...

متن کامل

Statistical physics modeling of equilibrium adsorption of cadmium ions onto activated carbon, chitosan and chitosan/activated carbon composite

The adsorption ability of activated carbon, chitosan, and chitosan/activated carbon composite for cadmium separation from aqueous solution was analyzed via statistical physical modeling. The equilibrium data were analyzed by Langmuir, Hill, double layer model, and the multi-layer model with saturation isotherm models. Results showed that the multi-layer model with saturation could well describe...

متن کامل

spatial modeling of summer precipitation in North-west of Iran

In the present study, the main aim was the spatial evaluation summer rainfall of northwest of Iran based on30 stations in northwest of Iran during 30 years of statistical period (1985-2014). An attempt, using geo-statistical modeling by ordinary least squares (OLS) and geographically weighted regression (GWR) procedures, was also made. The results represented that the GWR model with higher S2, ...

متن کامل

Capturing and Categorizing Mental Models of Food Webs using QCM

This paper examines the use of qualitative representations in modeling the similarities and differences in causal reasoning for biological kinds between Menominee Native Americans and US majority culture. Qualitative Concept Maps are used for modeling and analyzing transcripts of interviews conducted with these groups. The individual models are used to construct generalizations for the groups, ...

متن کامل

A Statistical Study of two Diffusion Processes on Torus and Their Applications

Diffusion Processes such as Brownian motions and Ornstein-Uhlenbeck processes are the classes of stochastic processes that have been investigated by researchers in various disciplines including biological sciences. It is usually assumed that the outcomes of these processes are laid on the Euclidean spaces. However, some data in physical, chemical and biological phenomena indicate that they cann...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001